Iterative Learning Procedure With Reinforcement for High-Accuracy Force Tracking in Robotized Tasks
نویسندگان
چکیده
منابع مشابه
Tracking in Reinforcement Learning
Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the environment of the learning agent can be considered as stationary, generalized policy iteration frameworks, because of the interleaving of learning and control, will produce non-stationarity of the evaluated policy a...
متن کاملA Framework for Robotized Teleoperated Tasks
Teleoperation systems allow the extension of the human operator’s sensing and manipulative capability into a remote environment to perform tasks at a distance, but the time-delays in the communications affect the stability and transparency of such systems. This work presents a teleoperation framework in which some novel tools, such as nonlinear controllers, relational positioning techniques, ha...
متن کاملPolicy Improvement through Safe Reinforcement Learning in High-Risk Tasks
Reinforcement Learning (RL) methods are widely used for dynamic control tasks. In many cases, these are high risk tasks where the trial and error process may select actions which execution from unsafe states can be catastrophic. In addition, many of these tasks have continuous state and action spaces, making the learning problem harder and unapproachable with conventional RL algorithms. So, whe...
متن کاملTwo Steps Reinforcement Learning in Continuous Reinforcement Learning Tasks
Two steps reinforcement learning is a technique that combines an iterative refinement of a Q function estimator that can be used to obtains a state space discretization with classical reinforcement learning algorithms like Q-learning or Sarsa. However, the method requires a discrete reward function that permits learning an approximation of the Q function using classification algorithms. However...
متن کاملReinforcement Learning with Unsupervised Auxiliary Tasks
Deep reinforcement learning agents have achieved state-of-the-art results by directly maximising cumulative reward. However, environments contain a much wider variety of possible training signals. In this paper, we introduce an agent that also maximises many other pseudo-reward functions simultaneously by reinforcement learning. All of these tasks share a common representation that, like unsupe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Industrial Informatics
سال: 2018
ISSN: 1551-3203,1941-0050
DOI: 10.1109/tii.2017.2748236